Extracting generic basis of association rules from SAGE data
نویسندگان
چکیده
Applying classical association rule extraction framework to dense SAGE data leads to an unmanageably highly sized association rule sets– compounded with their low precision– that often make the perusal of knowledge ineffective, their exploitation time-consuming, and frustrating for the user. To overcome such drawback, we advocate the extraction and exploitation of compact and informative generic basis of association rules. Obtained preliminary results highlight that the extracted correlations may be of help in identifying a gene functional groups and thus contribute to their annotation. From a biologic point of view, such identification may be a powerful verification technique for hampering gene mis-annotating or badly clustering in the Unigene library.
منابع مشابه
EGEA : A New Hybrid Approach Towards Extracting Reduced Generic Association Rule Set (Application to AML Blood Cancer Therapy)
To avoid obtaining an unmanageable highly sized association rule sets– compounded with their low precision– that often make the perusal of knowledge ineffective, the extraction and exploitation of compact and informative generic basis of association rules is a becoming a must. Moreover, they provide a powerful verification technique for hampering gene mis-annotating or badly clustering in the U...
متن کاملA new generic basis of "factual" and "implicative" association rules
The extremely large number of association rules that can be drawn from – even reasonably sized datasets, bootstrapped the development of more acute techniques or methods to reduce the size of the reported rule sets. In this context, the battery of results provided by the Formal Concept Analysis (FCA) allowed one to define “irreducible” nuclei of association rule subset better known as generic b...
متن کاملPrediction of chronic kidney disease in Isfahan with extracting association rules using data mining techniques
Background: Millions of deaths occur around the world each year due to lack of access to appropriate treatment for chronic kidney disease patients. Given the importance and mortality rate of this disease, early and low-cost prediction is very important. The researchers intend to identify chronic kidney disease through the optimal combination of techniques used in different stages of data mining...
متن کاملیافتن الگوهای مکرّر در قرآن کریم بهکمک روشهای متنکاوی
Quran’s Text differs from any other texts in terms of its exceptional concepts, ideas and subjects. To recognize the valuable implicit patterns through a vast amount of data has lately captured the attention of so many researchers. Text Mining provides the grounds to extract information from texts and it can help us reach our objective in this regard. In recent years, Text Mining on Quran and e...
متن کاملAvoiding the itemset closure computation ”pitfall”
Extracting generic bases of association rules seems to be a promising issue in order to present informative and compact user addedvalue knowledge. However, extracting generic bases requires partially ordering costly computed itemset closures. To avoid the nightmarish itemset closure computation cost, specially for sparse contexts, we introduce an algorithm, called Prince, allowing an astute ext...
متن کامل